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Abstract: We study the Monge transportation problem when the cost is the action associated 
to a Lagrangian function on a compact manifold. We show that the transportation can be 
interpolated by a Lipschitz lamination. We describe several direct variational problems the 
minimizers of which are these Lipschitz laminations. We prove the existence of an optimal 
transport map when the transported measure is absolutely continuous. We explain the relations 
with Mather's minimal measures. 



Resume: On etudie le probleme de transport de Monge lorsque le cout est Taction associee 
a un Lagrangien sur une variete compacte. On montre que le transport pent etre interpole 
par une lamination lipschitzienne. On decrit plusieurs problemes variationnels directs dont ces 
laminations sont les minimiseurs. On montre I'existence d'une application de transport opti- 
male lorsque la mesure transportee est absolument continue. On explique les relations avec les 
mesures minimisantes de Mather. 
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Several observations have recently renewed the interest for the classical topic of optimal 
mass transportation, whose primary origin is attributed to Monge a few years before French 
revolution. The framework is as follows. A space M is given, which in the present paper will 
be a compact manifold, as well as a continuous cost function c{x, y) : M x M — > M. Given two 
probability measures /io and fii on M, the mappings ^ : M — > M which transport /xq into /ii 
and minimize the total cost c(x, ^'(x))d/io are studied. It turns out, and it was the core of 
the investigations of Monge, that these mappings have very remarkable geometric properties, at 
least at a formal level. 

Only much more recently was the question of the existence of optimal objects rigorously 
solved by Kantorovich in a famous paper of 1942. Here we speak of optimal objects, and not of 
optimal mappings, because the question of existence of an optimal mapping is ill-posed, so that 
the notion of optimal objects has to be relaxed, in a way that nowadays seems very natural, and 
that was discovered by Kantorovich. 

Our purpose here is to continue the work initiated by Monge, recently awakened by Brenier 
and enriched by other authors, on the study of geometric properties of optimal objects. The 
costs functions we consider are natural generalizations of the cost c{x,y) = d{x,y)'^ considered 
by Brenier and many other authors. The book [39] gives some ideas of the applications expected 
from this kind of questions. More precisely, we consider a Lagrangian function L{x,v,t) : 
TM X M — > M which is convex in v and satisfies standard hypotheses recalled later, and define 
our cost by 

c{x,y) = min / L{'y{t),'y{t),t)dt 

-y Jo 

where the minimum is taken on the set of curves 7 : [0, 1] — > M satisfying 7(0) = x and 
7(1) = y. Note that this class of costs does not contain the very natural cost c{x,y) = d{x,y). 
Such costs are studied in a second paper [9]. 

Our main result is that the optimal transports can be interpolated by measured Lipschitz 
laminations, or geometric currents in the sense of Ruelle and Sullivan. Interpolations of trans- 
port have already been considered by Benamou, Brenier and McCann for less general cost 
functions, and with different purposes. Our methods are inspired by the theory of Mather, 
Mane and Fathi on Lagrangian dynamics, and we will detail rigorously the relations between 
these theories. Roughly, they are exactly similar except that mass transportation is a Dirichlet 
boundary value problem, while Mather theory is a periodic boundary value Problem. We will 
also prove, extending works of Brenier, Gangbo, McCann, Carlier, and other authors, that the 
optimal transportation can be performed by a Borel map with the additional assumption that 
the transported measure is absolutely continuous. 

Various connections between Mather- Fathi theory, optimal mass transportation and Hamilton- 
Jacobi equations have recently been discussed, mainly at a formal level, in the literature, see for 
example [39], or [19], where they are all presented as infinite dimensional linear programming 
problems. This have motivated a lot of activity around the interface between Aubry-Mather 
theory and optimal transportation, some of which overlap partly the present work. For example, 
at the moment of submitting the paper, we have been informed of the existence of the recent 
preprints of De Pascale, Gelli and Granieri, [15j . and of Granieri, [26]. We had also been aware 
of a manuscript by Wolansky [40j for a few weeks, which, independently, and by somewhat 
different methods, obtains results similar to ours. Note however that Lipschitz regularity, which 
we consider as one of our most important results, was not obtained in this preliminary version 
of [40j . It is worth also mentioning the papers [36] of Pratelli and [31] of Loeper. 

This paper emanates from the collaboration of the Authors during the end of the stay of the 
first author in EPFL for the academic year 2002-2003, granted by the Swiss National Science 
Foundation. 
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1 Introduction 

We present the context and the main results of the paper. 
1.1 Lagrangian, Hamiltonian and cost 

In ah the present paper, the space M will be a compact and connected Riemannian manifold 
without boundary. Some standing notations are gathered in the appendix. Let us fix a positive 
real number T, and a Lagrangian function 

L G C'^iTM X [0,T],R). 

A curve 7 € C^([0,T],M) is called an extremal if it is a critical point of the action 

f L{^{t),jit),t)dt 
Jo 

with fixed endpoints. It is called a minimizing extremal if it is minimizing the action. We 
assume: 

convexity For each {x,t) £ M x [0,T], the function v 1 — > L{x,v,t) is convex with positive 
definite Hessian at each point. 

superlinearity For each {x,t) G M x [0,T], we have w, || — > 00 as — > 00. 
Arguing as in [20j (Lemma 3.2.2), this implies that for all a > there exists C > such that 
L{x,v,t) ^ a||t;|| - C for ah {x,v,t) G TM x [0,r]. 

completeness For each {x,v,t) € TM x [0, T], there exists one and only one extremal 7 G 
C^{[0,T],M) such that (7(t),7(t)) = {x,v). 

We associate to the Lagrangian L a Hamiltonian function H G C^{T*M x [0, r],M) given by 

H{x,p, t) = maxp(t>) — L{x, v, t). 

V 

We endow the cotangent bundle T*M with its canonical symplectic structure, and associate to 
the Hamiltonian H the time-dependent vectorfield Y on T*M, which is given by 

Y = {dpH, -d^H) 

in any canonical local trivialisation of T*M. The hypotheses on L can be expressed in terms of 
the function H: 

convexity For each {x,t) G Af x [0,T], the function p 1 — > H{x,p,t) is convex with positive 
definite Hessian at each point. 

superlinearity For each {x,t) G M x [0, T], we have H{x,p,t)/\\p\\ — > 00 as — > 00. 
completeness Each solution of the equation (x{t),p{t)) = Y{x{t),p{t),t) can be extended to 
the interval [0, T]. We can then define, for all s, t G [0, T], the flow of Y from times s to time 
t. 

In addition, the mapping d^L : TM x [0,r] — > T*M x [0,r] is a diffeomorphism, 
whose inverse is the mapping dpH. These diffeomorphisms conjugate Y with a time-dependent 
vectorfield E on TM. We denote the flow of E hy Tpl : TM — > TM {s,t G [0,T]), which is 
such that i/jg = Id and dt^l^l = EfO ip^, where as usual Et denotes the vectorfield E{.,t) on TM. 
The diffeomorphisms d^L and dpH conjugate the flows and 93*. Moreover the extremals are 
the projection of the integral curves of E and 

(vroV^^t^oV'*)) =V*, (1) 



3 



where vr : TM M is the canonical projection. In ([T]), ^^(vr o is seen as a vector in the 
tangent space of M at vr o t/;*. If dt{iT o is seen as a point in TM, ([1]) becomes simply 

For each ^ s < t ^ T, we define the cost function 

c*(x,y) = min/ L{j{a),'y{a),a)da 

where the minimum is taken on the set of curves 7 G C^([s,t],M) satisfying 7(5) = x and 
7(t) = y. That this minimum exists is a standard result under our hypotheses, see [33] or [2U] . 

Proposition 1. Lei us fix a subinterval [s,t] C [0,T]. The set £ C C'^{[s,t\, M) of minimizing 
extremals is compact for the topology. 

Let us mention that, for each {xo,s) € M x [0,T], the function {x,t) 1 — > c*(xo,x) is a 
viscosity solution of the Hamilton-Jacobi equation 

dfU + H{x, dxU, t) = 

on Mx]s,T[. This remark may help the reader in understanding the key role which will be 
played by this equation in the sequel. 



1.2 Monge-Kantorovich theory 

We recall the basics of Monge-Kantorovich duality. The proofs are available in many texts on 
the subjects, for example [U [371 [39]. We assume that M is a compact manifold and that c(x,y) 
is a continuous cost function on M x M, which will later be one of the costs c* defined above. 
Given two Borel probability measures fiQ and fii on M, a transport plan between //q and /Ui is 
a measure on M x M which satisfies 

7rotj(r/) = fiQ and 7rn(r/) = fii, 

where ttq : M x M — > M is the projection on the first factor, and tti is the projection on the sec- 
ond factor. We denote by IC{fiQ , fii) , after Kantorovich, the set of transport plans. Kantorovich 
proved the existence of a minimum in the expression 

C(^0;/"i)= ™in / cdr] 

for each pair (^0)/^i) of probability measures on M. Here we will denote by 

Cl{fio,fJ-i) := min / cl{x,y)dr]{x,y) (2) 

r)G/C{/io,Ati) JMxM 

the optimal value associated to our family of costs c* The plans which realize this minimum 
are called optimal transfer plans. A pair (0o, of continuous functions is called an admissible 
Kantorovich pair if is satisfies the relations 

c/)i{x) = min 00 (y) + c{y,x) and (/)o(x) = max0i(y) -c{x,y) 

y^M y£M 

for all point x E M. Note that the admissible pairs are composed of Lipschitz functions if the 
cost c is Lipschitz, which is the case of the costs c* when s < t. Another discovery of Kantorovich 
is that 

C(/io,/ii) = max ( / (pidfii- / ^o^^^o) (3) 

</'0,</'i ^ J M J M ' 
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where the maximum is taken on the set of admissible Kantorovich pairs ((/>o,(/>i). This maxi- 
mization problem is called the dual Kantorovich problem, the admissible pairs which reach this 
maximum are called optimal Kantorovich pairs. The direct problem ([2]) and dual problem ([3]) 
are related as follows. 

Proposition 2. If rj is an optimal transfer plan, and if is a Kantorovich optimal pair, 

then the support of r] is contained in the set 

{{x,y) e such that (j)i{y) - (j)Q{x) = c{x,y)} C M x M. 

Let us remark that the knowledge of the set of Kantorovich admissible pairs is equivalent to 
the knowledge of the cost function c. 

Lemma 3. We have 

c{x,y) = max (pi{y) - (/)o{x) 
where the maximum is taken on the set of Kantorovich admissible pairs. 

Proof. This maximum is clearly less that c{x, y). For the other inequality, let us fix points xq 
and yo in M, and consider the functions (j)i{y) = c{xo,y) and (j)o{x) = maXy^M (t'liv) ~ c{x,y). 
We have <f)i{yo) — 4>o{xo) = c(xo, yo) — = c(xo, yo)- So it is enough to prove that the pair (^o, (pi) 
is an admissible Kantorovich pair, and more precisely that <j)i{y) = min^^gA/ (Pq{x) + c(x, y). We 
have 

4>o{x) + c{x,y) ^ c{xo,y) - c{x,y) + c{x,y) ^ c(xo,y) = (piiy) 
which gives the inequality (j)i{y) ^ min^^gA/ (poix) + c(x, y). On the other hand, we have 

mincpoix) + c{x,y) ^ (po{xo) +c{xo,y) = c{xo,y) = (pi{y). 

□ 

1.3 Interpolations 

In this section, the Lagrangian L and time T > are fixed. It is not hard to see that, if /ii,^2 
and /i3 are three probability measures on M, and if ti ^ ^2 ^ ^3 £ [0, T] are three times, then 
we have the inequality 

The family fit,t € [0,T] of probability measures on M is called an interpolation between /ig and 
fiT if it satisfies the equality 

for all ^ ti ^ ^2 ^ ^3 ^ ^- Our main result is the following: 

Theorem A. For each pair /^O) /^T of probability measures, there exist interpolations between hq 
and ht- Moreover, each interpolation {^t),t £ [0,7"] is given by a Lipschitz measured lamination 
in the following sense: 

Eulerian description : There exists a bounded locally Lipschitz vectorfield X{x,t) : Mx]0, T[ — 
TM such that, g]0, is the flow of X from time s to time t, then {"^D^fis = /^t for 

each {s,t) G]0,rp. 

Lagrangian description : There exists a family T C C^([0, r],M) of minimizing extremals 
7 of L, which is such that the relation ^{t) = X{'y{t),t) holds for each t g]0,T[ and for each 
7 € .F. The set 

^ = {{l{t),^{t),t),t G]0, T[, 7 G n C TMx]0, T[ 
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is invariant under the Euler- Lagrange flow ip. The measure fit is supported on % = {7(t),7 € 
T}. In addition, there exists a continuous family mt,t € [0, T] of probability measures on TM 
such that mt is concentrated onTt = {(7(t), 7(i)), 7 G J^} for each t g]0, T[, such that n^rrit = Ht 
for each t € [0,T], and such that 

for all {s,t) € [0,T]2. 

Hamilton-Jacobi equation : There exists a Lipschitz and function v{x, t) : Mx]0, T[ — > M 
which satisfies the inequation 

dtv + H{x,dxV,t) ^ 0, 

with equality if and only if {x,t) £ T = {{j{t),t),^ £ J^,t €]0,T[}, and such that X{x,t) = 
dpH{x, dxv{x,t),t) for each {x,t) £ T. 

Uniqueness : There may exist several different interpolations. However, it is possible to choose 
the vectorfield X, the family T and the sub-solution v in such a way that the statements above 
hold for all interpolations fit with these fixed X, T and v. For each s < t €]0,T[, the measure 
(Id X ^D^fig is the only optimal transport plan in IC^fisTfJ-t) for the cost c*. This implies that 

/ cl{x,¥,{x))dfisix) = Cl{fls,f^t)- 
JM 

Let us comment a bit the preceding statement. Tlie set T C rMx]0,T[ is the image by 
the Lipschitz map {x,t) {X{x,t),t) of the set T C rMx]0,r[. We shah not take X{x,t) = 
dpH{x, dxv{x, t),t) outside of T because we do not prove that this vectorfield is Lipschitz outside 
of T. The data of the vectorfield X outside of T is immaterial: any Lipschitz extension of X^q- 
will fit. Note also that the relation 

VI/* = vr o o X, (4) 

holds on Ts, where Xs{.) = X{., s). 

The vectorfield X in the statement depends on the transported measures fiQ and fiT- The 
Lipschitz constant of X, however, can be fixed independently of these measures, as we now state 
(see Proposition 1131 Proposition 1191 Theorem [3] and (llip ): 

Addendum There exists a decreasing function K{e) :]0, T/2[ >]0, oo[, which depends only on 

the time T and on the Lagrangian L, and such that, for each pair fiQ, fJ,T of probability measures, 
one can choose the vectorfield X in Theorem A in such a way that X is K{e)-Lipschitz on 
M X [e, T - e] for each e g]0, T/2[. 

Proving Theorem A is the main goal of the present paper. We will present in section [2] 
some direct variational problems which are well-posed and of which the transport interpolations 
are in some sense the solutions. We believe that these variational problems are interesting in 
themselves. In order to describe the solutions of the variational problem, we will rely on a dual 
approach based on the Hamilton-Jacobi equation, inspired from Fathi's approach to Mather 
theory, as detailed in section [3j The solutions of the problems of section [21 as well as the 
transport interpolations, are then described in section [U which ends the proof of Theorem A. 

1.4 Case of an absolutely continuous measure fiQ 

Additional conclusions concerning optimal transport can usually be obtained when the initial 
measure fiQ is absolutely continuous. For example a standard question is whether the optimal 
transport can be realized by an optimal mapping. 
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A transport map is a Borel map ^ : M — > M which satisfies ^'j/Uq = /^i- To any transport 
map ^ is naturally associated the transport plan (Id x ^)^fiQ, called the induced transport plan. 
An optimal map is a transport map ^ : M — > M such that 



CT{x,'^{x))dfio / CT{x,F{x))dfio 

M J M 

for any transport map F. It turns out that, under the assumption that /ig has no atoms, a 
transport map is optimal if and only if the induced transport plan is an optimal transport plan, 
see [1], Theorem 2.1. In other words, we have 



inf / c{x,'^{x))d^Q{x) = C{fj,o,fii), 

where the infimum is taken on the set of transport maps from fiQ to ^i. This is a general result 
which holds for any continuous cost c. It is a standard question, which turns out to be very 
hard for certain cost functions, whether the infimum above is reached, or in other words whether 
there exists an optimal transport plan which is induced from a transport map. Part of the result 
below is that this holds true in the case of the cost Cg . The method we use to prove this is an 
elaboration on ideas due to Brenier, see [12] and developed for instance in [24], (see also [23] ) 
and [T6], which is certainly the closest to our needs. 

Theorem B. Assume that fj,Q is absolutely continuous with respect to the Lebesgue class on M . 
Then for each final measure fix, there exists one and only one interpolation fit,t £ [0, T], and 
each interpolating measure iJ,t,t < T is absolutely continuous. In addition, there exists a family 
^'q : M ^ M, t g]0,T], of Borel maps such that (Id x ^'q)j^o is the only optimal transfer plan 
in /C(//o,A*t) for the cost function Cq. Consequently, we have 

[ c*(x,**(x))(i/io(x) = C*(^o,Mi), 0<t^T. 

If Ht, instead of hq, is assumed absolutely continuous, then there exists one and only one inter- 
polation, and each interpolating measure nt,t G]0,T] is absolutely continuous. 

This theorem will be proved and commented in section [5l 

1.5 Mather theory 

Let us now assume that the Lagrangian function is defined for all times, L E C'^{TM x 
and, in addition to the standing hypotheses, satisfies the periodicity condition 

L(x, f + 1) = L(x, V, t) 

for all [x^v^t) E TM x M. A Mather measure, see [33], is a compactly supported probability 
measure mo on TM which is invariant in the sense that (V'o)tt™'0 = "^o and is minimizing the 
action 

Al{mo) = L{'4)l{x,v),t)dmQdt. 

JTAfx[0,l] 

The major discovery of [33] is that Mather measures are supported on the graph of a Lipschitz 
vectorfield. Let us call a the action of Mather measures -this number is the value at zero of the 
a function defined by Mather in [33]. Let us now explain how this theory of Mather is related 
to, and can be recovered from, the content of our paper. 
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Theorem C. We have 

a = min Cn (/i, /i), 

where the minimum is taken on the set of probability measures on M . The mapping mo i — > vrjjmo 
is a bisection between the set of Mather measures tuq and the set of probability measures fj, on M 
satisfying Cq[^^^) = a. There exists a Lipschitz vectorfield Xq on M such that all the Mather 
measures are supported on the graph of Xq . 

This theorem will be proved in sectional where the bijection between Mather measures and 
measures minimizing Co(/i, /x) will be precised. 



2 Direct variational problems 

We state two different variational problems whose solutions are the interpolated transports. 
We believe that these problems are interesting on their own. They will also be used to prove 
Theorem A. 



2.1 Measures 

This formulation parallels Mather's theory. It can also be related to the generalized curves of 
L. C. Young. Let /ig and ^jlt be two probability Borel measures on M. Let mo G Bi{TM) be a 
Borel probability measure on the tangent bundle TM. We say that mo is an initial transport 
measure if the measure r] on M x M given by 

r]= {tt X {it o^p^))^mo 

is a transport plan, where vr : TM — > M is the canonical projection. We call I{^q,^t) the set 
of initial transport measures. To an initial transport measure mo, we associate the continuous 
family of measures 

= {'>Po)^'mQ,t G [o,r] 

on TM, and the measure m on TM x [0, T] given by 

m = mt ^ dt = [{'ipQ)^mo) (g) dt. 

Note that the linear mapping mo i — > m = {('ipQ)^mQ)®dt is continuous from B{TM) to B{TM x 
[0,T]) endowed with the weak topology, see appendix. 

Lemma 4. The measure m satisfies the relation 

/ dtfix,t) + dxf{x,t)-vdm{x,v,t)= / frdfiT - / fodfio (5) 

Jtmx[o,t] Jm Jm 

for each function f € C^{M x [0,T],M), where ft denotes the function x i — > f{x,t). 

Proof. Setting f{x,v,t) = f{x,t), gi{x,v,t) = dtf{x,t) = dtf{x,v,t) and g2{x,v,t) = 
dxf{x, t) ■ V, we have 

/ dtf{x,t) + dxf{x,t)-vdm{x,v,t)= / {gi + g2) o ijj\^dmQdt. 

JTMx[0,T] Jo JTM 

Noticing that, in view of equation ([1]), we have 

dtiJo V'o) = 51 o V'o + 52 o V'o 
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we obtain that 

/ dtfix,t) + dxfix,t) ■vdm{x,v,t) = {f o Tp^ - f)dmo = frd^T - fodf^o 

JtMx[0,T] JTM JM JM 

as desired. □ 

Definition 5. A finite Borel measure on TAl x [0, T] which satisfies is called a transport 
measure. We denote by A4{ij,q, fix) the set of transport measures. A transport measure which is 
induced from an initial measure rriQ is called an invariant transport measure. The action of the 
transport measure m is defined by 



A{m) = / L{x,v,t)dm eRU {00} 

JTMxlO.T] 



lTMx[0,T] 

The action A(mQ) of an initial transport measure is defined as the action of the associated 
transport measure m. We will also denote this action by vIq (mo) when we want to insist on the 
time interval. We have 

^0 ("T-o) = / L{'ipl{x,v),t)dmQdt. 

JTMx[0,T] 

Notice that initial tranport measures exist: 

Proposition 6. The mapping (vr x (t^oiPq))^ '■ T{fiQ, ^t) — ^ ^(M0)/^t) is surjective. In 
addition, for each transport plan r], there exists a compactly supported initial transport measure 
ruQ such that (vr x (vr o V'o"))tt"''0 = ^ o^i^d such that 



M^o) = Cq {x,y)dri. 

JMxM 

Proof. By Proposition [H there exists a compact set K € TM such that if ^{t) : [0,r] — > M 
is a minimizing extremal, then the lifting (7(t),7(i)) is contained in K for each t G [0,T]. We 
shall prove that, for each probability measure r] G B{M x M), there exists a probability measure 
m-o G B{K) such that (vr x (vr o il;'^))^mQ = rj and such that 



^("T-o) = / Co {x,y)dri. 

JMxM 



Observing that 



• the mappings mo — > (vr x (tt o ij)Q))^mQ and mo 1 — > ^(mo) are linear and continuous on 
the space Bi{K) of probability measures supported on K, 

• the set Bi{K) is compact for the weak topology, and the action A is continuous on this 

set, 

• the set of probability measures on M x M is the compact convex closure of the set of Dirac 
probability measures (probability measures supported in one point), see e. g. [TO], p. 73, 

it is enough to prove the result when r/ is a Dirac probability measure (or equivalently when ^uq 
and /iT are Dirac probability measures). Let r] be the Dirac probability measure supported at 
(a;o,a;i) G M x M. Let 7(t) : [0,T] — > M be a minimizing extremal with boundary conditions 
7(0) = xq and 7(T) = xi. In view of the choice of K, we have (7(0), 7(0)) G K. Let mo be the 
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Dirac probability measure supported at (7(0), 7(0)). It is straightforward that ruf is then the 
Dirac measure supported at (7(t),7(t)), so that 

^("^o) = / Ldmtdt = / L{-j{t),j{t),t)dt = Cq {xo,xi) = / c^dr] 
Jo Jo JmxM 

and 

(vr X (vr o Vif ))t)"^o = V- 

□ 

Although we are going to build minimizers by other means, we believe the following result 
is worth being mentioned. 

Lemma 7. For each real number a, the set M^{fiQ, fix) of transport measures m which satisfy 
A{m) ^ a, as well as the set I"'{fio, ht) of initial transport measures tuq which satisfy A^{mQ) ^ 
a, are compact. As a consequence, there exist optimal initial transport measures, and optimal 
transport measures. 

Proof. This is an easy application of the Prohorov theorem, see the Appendix. q 

Now we have seen that the problem of finding optimal transport measures is well-posed, let 
us describe its solutions. 

Theorem 1. We have 

C(f (/io, /Ut) = min A{m) = min A{mQ). 

■meM(po,t^T) moe2:(/io,/iT) 

The mapping 

mo I — > m = ((V'o)tt"i-o) <X) dt 

between the set OX of optimal initial measures and the set OA4 of optimal transport measures is 
a bisection. There exists a bounded and locally Lipschitz vectorfield X{x,t) : Mx]0,T[ — > TM 
such that, for each optimal initial measure rriQ € OZ, the measure mt = (V'o)j)"io is supported 
on the graph of Xt for each t g]0,T[. 

The proof will be given in section 14. 3[ Let us just notice now that the inequalities 

Cq {^0, Ht) ^ min A{mQ) ^ min A{m) 

hold in view of Proposition [HI 



2.2 Currents 

This formulation finds its roots on one hand in the works of Benamou and Brenier, see [6], and 
then Brenier, see [13], and on the other hand in the work of Bangert f5]. Let n^{M x [0,r]) be 
the set of continuous one-forms on M x [0,T], endowed with the uniform norm. We will often 
decompose forms uj £ Q^{M x [0, T]) as 

UJ = LJ^ + uj^dt, 

where to^ is a time-dependent form on M and is a continuous function on M x [0,T]. To 
each continuous linear form x on il^{M x [0, T]), we associate its time component /z^, which is 
the measure on M x [0, T] defined by 

/ fdfJ-x = xifdt) 

JMx[0,T] 



10 



for each continuous function / on Af x [0,T]. A Transport current between /xq ^iid is 
continuous linear form x ^^{M x [0,T]) which satisfies the two conditions: 

1. The measure ^-^ is non negative (and bounded). 

2. dx = fJ-T ^ — fJ'O ^ ^0^ which means that 



x{df) = / frdfiT - / fodno 
Jm Jm 



for each smooth (or equivalently C^) function f : M x [0,T] — > R. 

We cah C{ho,ij,t) the set of transport currents from hq to fiT- The set C{^o,^t) is a closed 
convex subset of [r2'^(M x [0,r])]*. We will endow C{^q,iit) with the weak topology obtained 
as the restriction of the weak-* topology of [r2'^(M x [0,T])]*. Transport currents should be 
thought of as vector fields whose components are measures, the last component being /i^. 

If Z is a bounded measurable vectorfield on M x [0,T], and if is a finite non-negative 
measure on M x [0, T], we define the current Z A by 



Z ^v{uJ) := / uj{Z)dv. 

JMx[0,T] 

Every transport current can be written in this way, see p2] or [25j. As a consequence, currents 
extend as linear forms on the set ^^{M x [0, T]) of bounded measurable one- forms. If / is 
a Borel subset of the interval [0,T], it is therefore possible to define the restriction xi of the 
current x to / by the formula Xl{'^) = x(l/'^)) where 1/ is the indicatrix of /. 

Lemma 8. If x is a transport current, then 

where r is the projection onto [0,T], see appendix. As a consequence, there exists a measurable 
familty ^t,t G]0,T[ of probability measures on M such that = fit dt, see appendix. There 
exists a set I c]0, T[ of total measure such that the relation 

ftd^t = / fodfiQ + X[o,t[{df) (6) 
M Jm 

holds for each function f : M x [0, T] — y M and each t G I. 

Proof. Let g : [0,T] — > M be a continuous function. Setting G{t) = fl^g{s)ds, we observe 
that 



/ gdf,^ = xidG)= [ GrdfLT- [ Godfio = GiT) - 0(0) = f g[ 
Jmx\o,t] Jm Jm Jo 



s)ds. 



This implies that t^h^ = dt. As a consequence, the measure ^j^. can be desintegrated as = 
fj,t®dt. We claim that, for each function / : Mx[0,T] — > M, the relation ([6]) holds for almost 
every t. Since the space C^{M x [0,T],M) is separable, the claim implies the existence of a set 
I C]0, T[ of fuh Lebesgue measure such that © holds for alH G / and all / G C^(M x [0, T], R). 
In order to prove the claim, let us fix a function / in C^(M x [0,T],M). For each function 
g e CH[0,r],R), we have 

x{d{gf)) = xig'fdt) + x{gdf) 

hence ^ 

g{T) f frdf^T-giO) [ /od^o = / g'it) [ ftdfitdt + xigdf). 
Jm Jm Jo Jm 
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By applying this relation to a sequence of functions g approximating l[o,j[, we get, at the 
limit 



fodfiQ = - ftdfit + X[o,t[{df) 

M JM 

at every Lebesgue point of the function t — > /^^ ftd^t ■ □ 

If /io = fJ'T, an easy example of transport current is given by xl"^) = Jm lo ^^^^^t^^- Here 
are some more interesting examples. 

Regular transport currents. The transport current x is called regular if there exists a 
bounded measurable section X of the projection TM x [0,T] — > M x [0, T], and a a non- 
negative measure on M x [0, T] such that x = 1) ^ ^- The time component of the current 
{X, 1) A is fi. In addition, if {X, 1) A ^ = {X' , 1) A fi for two vectorfields X and X' , then X 
and X' agree /^-almost everywhere. 

The current x = 1) A^, with X bounded, is a regular transport current if and only if there 
exists a (unique) continuous family fit G Si{M),t € [0,T] (where fiQ and afe the transported 
measures) such that fiy. = fit ® dt and such that the transport equation 

dtfit + dx.{Xfit) = 

holds in the sense of distributions on Mx]0,T[. The relation 



ftdfit - I fsdfis = Xls,t[idf) 

M JM 

then holds for each function f and each s ^ t in [0,T]. 

In order to prove that the family fit can be chosen continuous, pick a function / G C^(M, M) 
and notice that the equation 

/ fdfit - / fdfis = X[s,t[idf) = df ■ X^dfiada 

JM JM Js JM 

holds for all s ^ t in a subset of total measure / C [0,T]. Note that this relation also holds if 
s = and t G / and if s G / and t = T. Since the function a i — > Jj^j df ■ X^jdfi^ is bounded, we 
conclude that the function 1 1 — > fj^j- fdfit is Lipschitz on / U {0, T} for each / G C^(M, M), with 
a Lipschitz constant which depends only on ||ci/||oo ■ ||^||oo- The family fit is then Lipschitz on 
/ U {0, T} for the 1-Wasserstein distance on probability measures, see |39l [T71 H] for example, 
the Lipschitz constant depending only on |[X||oo- It suffices to remember that, on the compact 
manifold M, the 1-Wasserstein distance on probabilities is topologically equivalent to the weak 
topology, see for example [H], (48.5) or [39] . 

Smooth transport currents. A regular transport current is said smooth if it can be written 
on the form (X, 1) A A with a bounded vectorfield X smooth on Mx]0,T[ and a measure A 
that has a positive smooth density with respect to the Lebesgue class in any chart in Mx]0, T[. 
Every transport current in C{fiQ,fiT) can be approximated by smooth transport currents, but 
we shall not use such approximations. 

Lipschitz regular transport currents. A regular transport current is said Lipschitz regular 
if it can be written in the form {X, 1) A fi with a vectorfield X which is bounded and locally 
Lipschitz on Mx]0,T[. Smooth currents are Lipschitz regular. Lipschitz regular transport 
currents have a remarkable structure: 

Ux = (^) ^) Afi is a Lipschitz regular transport current with X bounded and locally Lipschitz 
on Mx]0, T[, then we have 

= l^t 



12 



where (s,t) g]0, T[^, denotes the flow of the Lipschitz vectorfield X from time s to time t, 
and nt is the continuous family of probability measures such that /i^ = /i^ (g) dt. 

This statement follows from standard representation results for solutions of the transport 
equation, see for example [3] or [3]. 

Transport current induced from a transport measure. To a transport measure m, we 
associate the transport current Xm defined by 



Xm(t^) = / {iL!^{x,t) ■ V + uj''{x,t))dm{x,v,t) 

Jtmx[o,t] 

where the form uj is decomposed as a; = uj^ + oj^dt. Note that the time component of the current 
Xm is TTjjm. We will see in Lemma [TT] that 

A{xm) ^ A{m) 

with the following definition of the action A{x) of a current, with equality if m is concentrated 
on the graph of any bounded vectorfield M x [0, T] — > TM. 

Lemma 9. For each transport current x, the numbers 

Ai{x)= sup (x(t^'',0)- / H{x,u;''{x,t),t)dn^ 
Loeno ^ Jmx[o,t] 

A2{x) = sup (xiuj) - / {H {x, u'' {x, t), t) dfi^ 

Loeno ^ Jmx[o,t] 

A^ix) = sup (xiuj) - T sup 

wen" ^ {x,t)eMx[o,T] 

A4{x) = sup x{^) 

Aaix) = sup xif^), 

are equal. In addition the number obtained by replacing in the above suprema the set $7' 

of continuous forms by the set r^oo of bounded measurable forms also have the same value. 







The last remark in the statement has been added in the last version of the paper and is 
inspired from [15]. 

Proof. It is straightforward that Ai = A2, this just amounts to simplifying the term J co^dji-^. 
Since ^-^ is a non-negative measure which satisfies /j\/x[ot] ^'^l^x ~ ^® have 

/ {H{x,uj''{x,t),t) +uj*)dn^i^T sup + w*) 

Jmx[Q,T] {x,t)eMx[0,T] 

SO that A^i^x) ^ A2{x)- In addition, we obviously have A^{x) ^ ^4(x) ^ ^3(x)- Now notice, in 
A2, that the quantity 



xM-/ {H{x,uj''{x,t),t)+uj^)dfi^ 
JMx[0,T] 

does not depend on w*. Let us consider the form cD = {uj^ , — H {x , u!'^ , t)) , which satisfies the 
equality H{x^uj^,t) = 0. We get, for each form uj, 



JMx[0,T] 
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Hence Ai{x) ^ A^{x)- Exactly the same proof shows that the numbers Af^{x) ^'^'^ equal. In 
order to end the proof, it is enough to check that ^2(x) = ^2°(x)- Writing the current x on the 
form Z Au with a bounded vectorfield Z and a measure u G B^{M x [0,r]), we have 

A2{x) = sup / Lo{Z)di^- / {H{ 

weno ^Jmx[o,t] Jmx[o,t] ' 

and 

A^{X)= sup ( / uj{Z)dv- [ {Hix,LO^ix,t),t)+J)d^i^. 
weHoo ^Jmx[o,t] Jmx[o,t] ' 

The desired result follows by density of continuous functions in L^(v + n-^. q 



Definition 10. We denote by A{x) o,nd call action of the transport current x the common value 
of the numbers Ai (x) ■ 

The existence of currents of finite action follows from the following: 

Lemma 11. We have 

A{x)= [ L{x,X{x,t),t)dii 

JMx[Q,T] 

for each regular current x = 1) A/x. If m is a transport measure, and if Xm is the associated 
transport current, then A{xm) ^ ^(tz), with equality ifm is supported on the graph of a bounded 
Borel vectorfield. As a consequence, we have the inequalities 

Cq {/iQ, fix) ^ min A{mo) ^ min A{m) min ^(x)- 

m()GX(/io,/iT) meM{no,HT) x&C{ho,Ht) 

Proof. For each bounded measurable form u, we have 

/ uj^{X) - H{x,uj^{x,t),t)diJ. / L{x,X{x,t),t)dix, 



so that 



A{{X, 1) A /x) ^ / L{x, X{x, t),t)dn. 



; [0,T] 

On the other hand, taking the form uJq{x, t) = dvL{x, X{x, t), t) we obtain the pointwise equality 

L(x, X{x, t), t) = ul{X) - H{x, u^{x, t),t) 

and by integration 

/ L{x,X{x,t),t)dii = u}q{X) - H{x,Lv'Q{x,t),t)dii ^ A{{X,1) A ii). 

Jmx[o,t] Jmx[o,t] 

This ends the proof of the equality of the two forms of the action of regular currents. Now if 
Xm is the current associated to a transport measure m, then we have, for each bounded form 
CO e n^{M X [0,T]), 

(w) - / u;\x,t) + H{x,uj^{x,t),t)dnj^ = / u^{v) - H{x,uj'' {x,t),t)dm 

Jmx[o,t] Jtmx[o,t] 



Xm 

by definition of Xm, so that 



A{Xm) ^ / L{x,v,t)dm = A{m) 

Jtmx[o,t] 
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by the Legendre inequality. In addition, if there exists a bounded measurable vectorfield 
X : M X [0, T] — > TM such that the graph of X x r supports m, then we can consider 
the form Wq associated to X as above, and we get the equality for this form. q 

Although we are going to provide explicitly a minimum of A, we believe the following Lemma 
is worth being mentioned. 

Lemma 12. The functional A : C{^q,^t) — ^ I^U {+00} is convex and lower semi- continuous, 
both for the strong and weak-* topologies on \pP{M x [0,T])]*. Moreover it is coercive with 
respect to the strong topology and hence it has a minimum. 

Proof. First note that A{'x) < 00 if x is the transport current corresponding to an initial 
transport measure in A4{p,o, fij-) arising from a transport plan. Let us define the continuous 
convex function Mr : J^°(M x [0,r]) — > M by 

]Hr(u;)=T sup H{x,oj''{x,t),t) + ujK 

{x,t)eMx[0,T] 

Then the action is the restriction to C{fio,fiT) of the Fenchel conjugate A = M* : [^.^{M x 
[0,T])]* — > M U l+cxo}. In other words, A is the supremum over uj of the family of affine 
functionals 

that are continuous both for the strong and weak-* topologies. Hence A is convex and lower 
semi-continuous for both topologies. Since 

A{x) ^ sup xi^) - sup B.t{uj), 

\\uj\\^l ||a;||^l 

A is coercive. The existence of a minimizer is standard: any minimizing sequence (xn) is bounded 
(thanks to coercivity) and has a weakly-* convergent subsequence (because Q,^{M x [0, T]) is a 
separable Banach space). By lower semicontinuity, its weak-* limit is a minimizer. Note that 
C(/iO)Mr) is weakly* closed. □ 

Theorem 2. We have 

Co{lJ'0,f^T) = min A{x) 

where the minimum is taken on all transport currents from fiQ to ^t- Every optimal transport 
current is Lipschitz regular. Let x = 1) A n be an optimal transport current, with X locally 
Lipschitz on Mx]0,T[. The measure m = {X x T)f^fi G B+{TMx]0,T[) is an optimal transport 
measure, and x is the transport current induced from m. Here r : TM x [0, T] — > [0, T] is the 
projection on the second factor, see appendix. We have 

CQ{no,fJ'T) = A{m) = A{x) = / L{x,X{x,t),t)dny^. 

Jmx[0,T] 

This result will be proved in 14. II after some essential results on the dual approach have been 
established. 

3 Hamilton-Jacobi equation 

Most of the results stated so far can be proved by direct approaches using Mather's shortening 
Lemma, which in a sense is an improvement of the initial observation of Monge, see [33] and 
0. We shall however base our proofs on the use of the Hamilton-Jacobi equation, in the spirit 
of Fathi's approach to Mather theory, see [20] . which should be associated to Kantorovich dual 
approach of the transportation problem. 
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3.1 Viscosity solutions and semi-concave functions 

It is certainly useful to recall the main properties of viscosity solutions in connection with semi- 
concave functions. We will not give proofs, and instead refer to [20], [21], [TJ], as well as the 
appendix in [8]. We will consider the Hamilton- Jacobi equation 

dtU + H{x,d^u,t) = 0. (HJ) 

The function u : M x [0, T] — > M is called i^-semi-concave if, for each chart 6 € Q (see 
appendix), the function 

{x,t) I — >u{9{x),t) - K{\\xf + t^) 

is concave on x [0, T]. The function u is called semi-concave if it is i^-semi-concave for some 
K. A function u : Mx]0, T[ — > M is called locally semi-concave if it is semi-concave on each 
M X [s,t], for < s < t < r. The following regularity result follows from Fathi's work, see [2^ 
and also [8j. 

Proposition 13. Let ui and U2 be two K -semi-concave functions. Let A be the set of minima 
of the function ui + ti2- Then the functions ui and U2 are differentiable on A, and dui{x,t) + 
du2{x,t) = at each point of {x,t) € A. In addition, the mapping dui : M x [0, T] — > T*M is 
CK-Lipschitz continuous on A, where C is a universal constant. 

Definition 14. We say that the function u : Mx]s,t[ — > R is a viscosity solution of (HJ) if 
the equality 

u{x, a) = min u{y, () + c?(y, x) 

holds for all X ^ M and all s < Q < a < t. 

We say that the function u : Mx]s,i[ — > M is a backward viscosity solution of {HJ) if the 
equality 

u{x,a) = maxM(y,C) - 

holds for all X ^ M and all s < a < ( < t. 

We say that the function v : Mx]s, t[ — > M is a viscosity sub-solution of (HJ) if the inequality 

v{x,a) ^ v{y,C) +c'^{y,x) 

holds for all x and y in M and all s < ( < a < t. 

Finally we will say that the function v : M x [s,t] — > M is a continuous viscosity solution 
(subsolution, backward solution) of (HJ) if it is continuous on M x [s,t] and if if v\Mx]s,t[ « 
viscosity solution of (HJ) (subsolution, backward solution). 

Notice that both viscosity solutions and backward viscosity solutions are viscosity sub- 
solutions. That these definitions are equivalent in our setting to the usual ones is studied 
in the references listed above, but is not useful for our discussion. The only fact which will be 
used is that, for a function u : Mx]s,t[ — > M, being a viscosity solution (or a backward 
viscosity solution) is equivalent to being a pointwise solution of (HJ), and being a viscosity 
sub-solution is equivalent to satisfy pointwise the inequality dtu + H{x, dxU, t) ^ 0. 
Diff"erentiability of viscosity solutions. Let u G C{M x [0, T[, M) be a viscosity solution of 
(HJ) (on the interval ]0,T[). We have the expression 

u{x,t) = min u( J (0),0) -\- / L('j(a),^(a),a)da 

■y Jo 

where the minimum is taken on the set of curves 7 £ C^Hs, t],M) which satisfy the final condition 
7(i) = x. Let us denote by T{x,t) the set of minimizing curves in this expression, which are 
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obviously minimizing extremals of L. We say that p E T*M is a proximal super-differential of 
the function u : M — > R at point x if there exists a smooth function / : M — > M such that 
f — u has a minimum at x and d^f = P- 

Proposition 15. Let us fix a point {x,t) £ Mx]0,T[. The function Uf is dijjerentiable at x if 
and only if the set T{x,t) contains a single element 7, and then dxu{x,t) = dyL{x,j{t),t). 

For all {x,t) G Mx]0,T[ and j G T{x,t), we set p{s) = dvL{'y{s),'y{s), s). Then p{0) is a 
proximal sub- differential of uq at 7(0), and p{t) is a proximal super- differential of ut at x. 

We finish with an important property on regularity of viscosity solutions: 

Proposition 16. For each continuous function uq : M — > M, the viscosity solution 

u{x,t) := minno(y) + Co(y,x) 

j/GM 

is locally semi-concave on ]0, T]. // in addition the initial condition uq is Lipschitz, then u is 
Lipschitz on [0,T]. 

For each continuous function ut '■ M — > M, the viscosity solution 

u{x,t) := maxuT(y) - cf{x,y) 

is locally semi-convex on [0, r[. // in addition the final condition ut is Lipschitz, then u is 
Lipschitz on [0, T] . 

Proof. The part concerning semi-concavity of u is proved in for example. It implies that 
the function u is locally Lipschitz on ]0, T], hence differentiable almost everywhere. In addition, 
at each point of differentiability of n, we have dtu + H{x, dxU, t) = and dxu{x, t) = p{t) = 
dyL{x,'y{t),t), where 7 : [0,t] — > M is the only curve in T{x,t). In order to prove that the 
function u is Lipschitz, it is enough to prove that there exists a uniform bound on It is 

known, see Proposition [TH that p{0) := 9t,L(7(0), 7(0), 0) is a proximal sub-differential of the 
function uq at point 7(0). If uq is Lipschitz, its sub-differentials are bounded: There exists a 
constant K such that \p{0)\ ^ K. By completeness, there exists a constant K', which depends 
only on the Lipschitz constant of wq, such that \p{s)\ ^ K' for all s G [0,t]. This proves that the 
function u is Lipschitz. The statements concerning u are proved in a similar way. □ 



3.2 Viscosity solutions and Kantorovich optimal pairs 

Given a Kantorovich optimal pair {(f)Q,(f)i), we define the viscosity solution 

u{x, t) := min <po{x) + Co(y, x) 

yGM 

and the backward viscosity solution 

u{x,t) := max 01 (y) - cf{x,y) 

yeM 

which satisfy uq = uq = <j)o, and ut = ut = (pi- Note that both 0i and — 0o are semi-concave 
hence Lipschitz, that the function u is Lipschitz and locally semi-concave on ]0, T], and that the 
function il is Lipschitz and locally semi-convex on [0, T[. 



Proposition 17. We have 



C^(^o,Mt) = max ( / urd^T - / uodfio), (7) 
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where the minimum is taken on the set of continuous viscosity solutions u : M x [0, T] — > R of 
the Hamilton- Jacobi equation {HJ). The same conclusion holds if the maximum is taken on the 
set of continuous backward viscosity solutions. The same conclusion also holds if the maximum 
is taken on the set of continuous viscosity sub-solutions of {HJ) . 

Proof. If u{x,t) is a continuous viscosity sub-solution of (HJ), then it satisfies 

ut{x) - uo{y) ^ CQ{y,x) 
for each x and y G M, and so, by Kantorovich duality, 

OUTdnT - / nod/xo) Cq {ixq,ixt)- 
M Jm ^ 

The converse inequality is obtained by using the functions u and u. □ 

Definition 18. // ((/'o,</'i) is a Kantorovich optimal pair, then we denote by ^{(f)o,(f>i) C 
C^{[0,T],M) the set of curves 'jit) such that 

MliT)) = Mim + r L{j{t),j{t),t)dt. 

Jo 

We denote by T(0o,0i) C Mx]0,T[ the set 

TicPoAi) = {i7it),t),t G]0,T[,7 e 
and by t(^o,^^i) C TMx]0,T[ the set 

ncpoAi) = {i7{t),j{t),t),t G]0,r[,7 G ;^(0o, </-!)}, 
which is obviously invariant under the Euler-Lagrange flow. 

Proposition 19. Let {4>q, (pi) be a Kantorovich optimal pair, and let u and u be the associated 
viscosity and backward viscosity solutions. 

1. We have u ^ u, and 

T {(po, (pi) = {{x, t) € Mx]0, T[ such that u{x, t) = u{x, t)}. 

2. At each point {x,t) G T{(po,(pi), the functions u and u are differentiable, and satisfy 
du{x,t) = du{x,t). In addition, the mapping {x,t) i — > du{x,t) is locally Lipschitz on 
T{(po,(pi). 

3. If^{t) G J^{(po,(pi), then dxu(j{t),t) = dyL{'y{t),'y{t),t). As a consequence, the set 

T*{(po,(pi) := {{x,p,t) G T*Mx]0,T[ such that {x,t) G T and p = d:^u{x,t) = d:^u{x,t)} 

is invariant under the Hamiltonian flow, and the restriction to T{(pQ, (pi) of the projection 
TT is a bi-locally-Lipschitz homeomorphism onto its image T{(po,(pi). 

Proof. Let us fix a point {x,t) G Mx]0,T[. There exist points y and z in M such that 
u{x,t) = (po{y) + CQ(y,.x) and u{x,t) = (pi{z) — cj'{x,z), so that 

u{x,t) - u{x,t) = (po{y) - (f)i{z) + CQ{y,x) -\-cf{x,z) 

^cl{y,z)-{(Pi{z)-(Poiy))^0. 



18 



In case of equality, we must have c^(y, z) = Cg(y, x) + c[{x, z). Let 7i(s) € C^([0, t], M) satisfy 
71 (0) = y, 71 (t) = X and L(7i(s), 71(5), s)ds = and let 72(5) G C2([t,r],M) satisfy 

72(t) = X, 72(7") = z and Jg ^(72(5), 72(5), s)(is = cf{x,z). The curve 7 : [0,T] — ^ M obtained 
by pasting 71 and 72 clearly satisfies Jq L{'y{s),'y{s), s)ds = CQ{y,z), it is thus a minimizer, 
and belongs to T{(f)o, As a consequence, we have {x,t) £ T{(j)o, cpi). 
Conversely, we have: 

Lemma 20. If v is a viscosity sub-solution of (HJ) satisfying vq = 4>q and vt = (t^i, then 
u ^ V ^ u. If {x, t) G T((j)Q, (f)i), then we have v{x, t) = u{x, t). 

Proof. The inequality u ^u\s easy. For example, for a given point (x, t) there exists y in 
M such that u{x,t) = (poiv) + CQ{y,x), and for this value of y, we have v{x,t) ^ 4>o{y) + Co(y,x), 
hence v{x,t) ^ u{x,t). The proof that u ^ v is similar. In order to prove the second part of the 
lemma, it is enough to prove that v{'-f{t),t) = u{'~f{t),t) for each curve 7 G J^{(j)o, cpi). Since v is 
a sub-solution, we have 

vij{T),T) ^vijit),t) + cfij{t),jiT)). 

On the other hand, we have 

v{-f{t),t) ^ u{j{t),t) < n(7(0),0) +4(7(0), 7(t)). 
As a consequence of all these inequalities, we have 

MliT)) = v{7iT),T) ^ u{jiO),0) + 4(7(0), 7(t)) + cf i^it),j{T)) ^ 0o(7(O)) + 0^(7(0), 7(T)) 

which is an equality because 7 G J^{(j)o, Hence all the inequalities involved are equalities, 
and we have v{'y(t),t) = u{'y{t),t). q 
The end of the proof of the proposition is straightforward. Point 2 follows from Proposition [13] 
applied to the locally semi-concave functions u and —u. Point 3 follows from Proposition I15[ q 



3.3 Optimal sub-solution 

The following result, on which a large part of the present paper is based, is inspired from 
but seems new in the present context. 

Proposition 21. We have 

Cq (/io, ^t) = max ( / vrd^T - / -vod/Uo ) , 

where the maximum is taken on the set of Lip schitz functions v : M x [0,T] — > M which are 
on Mx]0, T[ and satisfy the inequality 

dtv{x,t) + H{x,d^v{x,t),t) (8) 

at each point {x,t) G Mx]0,T[. 

Proof. First, let v{x,t) be a continuous function of M x [0, T] which is differentiable on 
Mx]0,r[, where it satisfies ([S]). We then have, for each curve 7(t) : [0,r] — > M, the 
inequality 

L{j{t),^{t),t)dt^ [ d^v{^it),t)-^{t)-H{j{t),vi^{t),t),t)dt 
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^ r d,v{j{t),t) ■ j{t) + dtv{j{t),t)dt = v{j{T),T) - v{j{0),0). 







As a consequence, we get v{y,T) — v{x,0) ^ c^[x,y) for each x and y, so that 



The converse fohows directly from the next theorem, which is an analog in our context of the 
main result of ^21j. q 

Theorem 3. For each Kantorovich optimal pair {(j)Q,(pi), there exists a Lip schitz function v : 
M X [0,r] — >R which is on Mx]0,T[, which coincides with u on M x {0, T} U T((/)o, 
and which satisfies the inequality ^ strictly at each point of Mx]0,T[—T{(j)Q,(j)i). 

Proof. The proof of [21\ can't be translated to our context in a straightforward way. Our 
proof is different, and, we believe, simpler. It is based on: 

Proposition 22. There exists a function V S C^(M x [0, T],M) which is null on T(0o,</'i) and 
which is positive on Mx]0, T[— T(0o, 0i), and such that 

My) = min + r L{j{t),i{t),t) - V{j{t),t)dt. (9) 

7{T)=y Jo 

Proof. Let us define the norm 

\Wh = 11"" ° ^llc2(Bix[0,T],M) 

6>ee 

of functions u G C'^{M x [0, T],M), where B is the atlas of M defined in the Appendix. Let us 
denote by U the open set Mx]0,T[—T{cj}Q,(j)i). We need a Lemma. 

Lemma 23. Let Ui C U be an open set whose closure Ui is compact and contained in U, and let 
e > be given. There exists a function V\ E C'^{M x [0, T], M), which is positive on Ui and null 
outside of Ui which is such that the equality holds with V = Vi, and such that ||Vi||2 ^ e. 

Proof. Let us fix the open set Ui, the pair ((pQ, (pi) and y G M. We claim that the minimum 
in 



min MliO))+ / L{j{t),j{t),t)-Vi{j{t),t)dt 
i(T)=y Jo 

is reached at a path 7 the graph of which does not meet Ui, provided that the function Vi is 
supported in Ui and is sufficiently small in the topology. In order to prove the claim, suppose 
the contrary. There exists a sequence (n G N) and a sequence 7^ such that 

min 0o(7(O)) + / Lij{t),jit),t) - Vn{l{t),t)dt 



l{T)=y Jo 

is reached at 7„, the graph of 7„ meets Ui, Vn is supported in C/i (for all n G N) and ^ in 
the topology. As a consequence each 7„ is and the sequence 7n {n G N) is a minimizing 
sequence for 

</.i(2/) = min ,/.o(7(0)) + / L(7(t), 7(t), (10) 
l(T)=y Jo 

Hence this sequence is compact for the topology and, extracting a subsequence if needed, 
can be assumed to converge to some 700- Clearly 700 is a minimizer for (llOp the graph of which 
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meets Ui. This is a contradiction with Ui C U = Adx]0,T[\T{(j)Q,(pi) and the fact that the 
graph of 7oo is included in T((/)o,</'i) (see Definition [T8|) . □ 

Let [/„ C [/, n E N be a countable sequence of open sets covering U and whose closures {/„ 
are contained in U. There exists a sequence Vn of functions of C^{M x [0,r],R) such that, for 
each n S N : 

• The function Vn is positive in Un and null outside of Un- 

• We have {{Ynh < 2-"e. 

• The equality ([9]) holds for the function = ^27=1 ^■ 

Such a sequence can be build inductively by applying the lemma to the Lagrangian L — V^~^ 
with en = 2~"e. Since \\Vn\\ ^ 2~"e, the sequence is converging in norm to a limit 
V € C'^{M X [0, T],E). This function V satisfies the desired properties. The proposition is 
proved. □ 

In order to finish the proof of the theorem, we shall consider the new Lagrangian L = L — V , 
and the associated Hamiltonian H = H + V, as well as the associated cost functions c*. Let 

u{x, t) := min (poiy) + Co(y, x), 

be the viscosity solution of the Hamilton-Jacobi equation 

dtu + H{x,d^u,t) = -V{x,t) (HJ) 

emanating from (f)Q. The equality Q says that ut = (t>i = ut- The function u is Lipschitz on 
M X [0, T], as a viscosity solution of {HJ) emanating from a Lipschitz function. It is obviously 
a viscosity sub-solution of the equation (HJ), which is strict outside of M x {0, T} U T{(j)Q, 0i) 
(where V is positive). This means that the inequality ([8]) is strict at each point of differentia- 
bility of u outside of M x {0,T} U T{(j)Q,4ii). We have n ^ tt ^ n, this relation being satisfied 
by each viscosity sub-solution of (HJ) which satisfies uq = (po and ut = (pi- As a consequence, 
we have u = u = u on T(0o, 0i), and the function u is differentiable at each point of T{(I)q, (pi). 
Furthermore, we have du = du = du on this set. We then obtain the desired function v of the 
theorem from the function u by regularisation, applying Theorem 9.2 of ^21j. q 



4 Optimal objects of the direct problems 

We prove Theorem A as well as the results of section [2j The following lemma generalizes a result 
of Benamou and Brenier, see [6]. 

Lemma 24. We have the equality 

Cq {hq, ^t) = min A(mo) = min A{m) = min ^(x)- 

moeX(/io,AtT) meMifKhfiT) X&C{p.o,^iT) 

Moreover xidv) = A(x) for every optimal x, where v is given by Theorem\^ 

Proof. In view of Lemma [TT| it is enough to prove that, for each transport current x ^ 
C{^o,Ijt), we have A{x) ^ C(f (/Uq, /xt). Let v : M x [0,T] — > M be a Lipschitz sub-solution of 
(HJ) which is on Mx]0, r[, and such that {vq,vt) is a Kantorovich optimal pair. For each 
current x € C(//0)Mr); we have A{x) ^ x{dv) = Cq {^q, ixt), which ends the proof. □ 

From now on we choose and fix: 
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• An optimal Kantorovich pair {(po,4>i). 

• A Lipschitz sub-solution v : M x [0, T] — > M of the Hamilton- Jacobi equation which 
satisfies vq = (po and vt = (pi and which is on Aix]0,T[. 

• A bounded vectorfield X{x,t) : Mx]0,T[ — y TM which is locally Lipschitz and satisfies 

X{x,t) =dpH{x,d^v{x,t),t) on T{(f)o,^i). (11) 

4.1 Chciracterization of optimal currents. 

Each optimal transport current x can be written 

X = {X, 1) A n^, 

with a measure concentrated on T((/>o,0i). The current x i^ then Lipschitz regular, so that 
there exists a transport interpolation iit,t £ [0,T] such that = iJ^t ® dt (see Appendix) and 
such that fj4 = for each s and t in ]0, r[. 

Proof. Let x be an optimal transport current, that is a transport current x ^ C(/xo,A*t) such 
that A{x) = Cq{^q,ijlt)- Let us recall the definition of the action A{x) that will be used here: 

A{x)= sup (x(t^^,0)- / H{x,io''{x,t),t)dn^). 

uien^' ^ Jmx[o,t] ' 

Since -ff (a;, dxV^ t) + dtv ^ 0, we have 

^(x) = X{dv) ^ xidv) - J H{x, dxv{x, t),t) + dtvdn^ = xi^xV, ^) - j H{x, dxv{x, t), t)dfj,^. 

The other inequality holds by the definition of A, so that 

X{dv) =x{dv) - J H{x,dxv{x,t),t) + dtvdn^ = x{dxV,0) - J H{x,dxv{x,t),t)diJ,^, 

and we conclude that the function H{x,dxv{x,t),t) + dfV vanishes on the support of fx^, or in 
other words that the measure fi^ is concentrated on the set T{(f)o, cpi). In addition, for all form 
a; = o;^ + u^dt, we have 

Xidxv + w^, 0) - y H{x, dxv + w^, t)dn^ ^ xidxv, 0) - J H{x, d^v, t)dn^ = A{x)- 

Hence the equality 

X(w'^,0) = J dpH{x,dxV,t){Lo'')diJ,^ 
holds for each form to. This equality can be rewritten 

X(a;) = J dpH{x,dxV,t){u'') + iJdii^ 

which is precisely saying that 

X = {dpH{x,dxv{x,t),t),l) = {X,l) A fi^. 

The last equality follows from the fact that the vectorfields X and dpH{x,dxv{x,t),t) are equal 
on the support of /i^. By the structure of Lipschitz regular transport currents, we obtain the 
existence of a continuous family G [0, T] of probability measures such that /j.^ = nt 'Si dt 
and such that fit = i^Dtf^s for each s and t in ]0,T[. Since the restriction to a subinterval 
[s, t] C [0, r] of an optimal transport current x is clearly an optimal transport current for the 
transportation problem between //^ and i^t with cost c* , we obtain that the path fit is a transport 
interpolation. q 
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4.2 Characterization of transport interpolations. 

Each transport interpolation fit satisfies 

for each (s,t) g]0, Tp. The mapping 

fit I — > {X, 1) A {fit ® dt) 
is a bijection between the set of transport interpolations and the set of optimal transport currents. 

Proof. We fix a transport interpolation fit and two times s < s' in ]0, T[. Let xi be a transport 
current on M x [0, s] between the measures fiQ and fis whicli is optimal for the cost eg, let X2 be 
a transport current on M x [s, s'] between the measures fig and fis' which is optimal for the cost 
Cg and let xs be a transport current on M x [s', T] between the measures fig' and fiT which is 
optimal for the cost cj. Then the current x on M x [0, T] which coincides with xi on M x [0, s], 
with X2 on M X [s,s'] and with X3 on [s',T] belongs to C(//0)Mr)- In addition, since fit is a 
transport interpolation, we have 

Mx) = CQ{fio,fis) + Cf {fis, fis') + Cf,{fis',fiT) = Cq {fio,fiT)- 

Hence x is an optimal transport current for the cost Cg • In view of the characterisation of 
optimal currents, this implies that x = {X, 1) A fi-^, and that 

Mx = ((^s)ttM^) ®dt= {{^\')il^s') ^ dt. 

By uniqueness of the continuous desintegration of fi-^, we obtain that, for each t g]0, T[, 
(^'*)tt/is = (^'*,)jj/is/, and since this holds for ah s and s', that {^D^fis = fit for all {s,t) G]0,Tp. 
It follows that X = {X, 1) A {fit ^ dt). We have proved that the mapping 

fit I — > {X, 1) A {fit dt) 

associates an optimal transport current to each transport interpolation. This mapping is obvi- 
ously injective, and it is surjective in view of the characterization of optimal currents. □ 

4.3 Characterization of optimal measures. 

The mapping 

XI >{X X T)^fi^ 

is a bijection between the set of optimal transport currents and the set of optimal transport 
measures (t : M x [0, T] — > [0, T] is the projection on the second factor; see Appendix). Each 
optimal transport measure is thus invariant (see (jH) and Definition\^\). The mapping 

mo' — > fit = {tto TpQ)imo 

is a bijection between the set of optimal initial measures mo and the set of interpolations. An 
invariant measure m is optimal if and only if it is supported on the set T((/>o, 4>i). 

Proof. If m is an optimal transport measure, then the associated current Xm is an optimal 
transport current, and A{m) = A{xm)- Let fim be the time component of Xm, which is also the 
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measure (7rxr)(jm. In view of the characterization of optimal currents, we have Xm = l)^A^m- 
We claim that the equality A{xm) = A(m) implies that m is supported on the graph of X. 
Indeed, we have the pointwise inequality 

d^v{x, t)-V - H{x, d^v{x, t),t) ^ L{x, V, t) (12) 

for each {x^V,t) € TMx]0,r[. Integrating with respect to m, we get the equality 



MXm) = Xmidv) = I d:^v{x,t) -V + dtv{x,t)dm{x,V,t) 

Jtmx[o,t] 

= / d^v{x,t) - V - H{x,dr,v{x,t),t)dm{x,V,t) = / L{x,V,t)dm{x,V,t) = A{m), 

JtMx[0,T] Jmx[0,T] 

which means that m is concentrated on the set where the inequality p2|) is an equality, that is 
on the graph of the vectorfield dpH{x, dxv{x, t),t). Since is supported on T, the measure m 
is supported on T and satisfies m = {X x T)^firn- Let /if be the transport interpolation such 
that Hm = fJ't'^ dt. Setting mt = {Xt)^fit, we have m = rrit® dt. Observing that the relation 

Xf o ^* = ^* o X, 
holds on 7^, we conclude, since (ig is supported on 7^, that 

('i/'s)t)"T-s = mt, 

which means that the measure m is invariant. 

Conversely, let ra = mt® dthe an invariant measure supported on T{(pQ, We have 

A{m) = / / L{x,v,t)dmt{x,v)dt = / / L{{'il^Q{x,v),t)dmQ{x,v)dt, 



JTM Jo JTM 



and by Fubini, 

l-T 

A{m) 



I L{{ipQ{x,v),t)dtdmo{x,v) = / (j)i{TT o ipQ (^x,v)) — (f)o{x)dmQ{x,v), 
Jtm 



ITM 

and since tuq is an initial transport measure, we get 



JTM JTM 



□ 



5 Absolute continuity 

In this section, we make the additional assumption that the initial measure is absolutely 
continuous, and prove Theorem B. The following lemma answers a question asked to us by 
Cedric Villani. 

Lemma 25. // hq or ht is absolutely continuous with respect to the Lebesgue class, then each 
interpolating measure fit,t g]0, T[, is absolutely continuous. 
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Proof. If ij,t,t G [0,r] is a transport interpolation, we have proved that 

tj,t = {tt o ipl o Xs)f,Hs 

for each s €]0, r[, and t G [0,T]. Since the function tt o -i/;! o is Lipschitz, it maps Lebesgue 
zero measure sets into Lebesgue zero measure sets, and so it transport singular measures into 
singular measures. It follows that if, for some s g]0,T[, the measure /Xs is not absolutely con- 
tinuous, then none of the measures /xt,t G [0, T] are absolutely continuous. □ 

In order to continue the investigation of the specific properties satisfied when hq is absolutely 
continuous, we first need some more general results. Let {(j)Q,(j)i) be an optimal Kantorovich 
pair for the measures and and for the cost Cq . Recall that we have defined J^((/>o, </>i) C 
C^([0,T],M) as the the set of curves 7(t) such that 

MliT)) = Mim + r Lij{t),j{t),t)dt. 

Jo 

Let J-q{(Po, (pi) be the set of initial velocities (x, v) G TM such that the curve t i — > vr o ^/'g(x, v) 
belongs to J^(0O)^i)- Note that there is a natural bijection between J^oifpOjCpi) and J^(0O)0i)- 

Lemma 26. The set J-o{(f)o,(f>i) is compact. The maps vr and vr o tjj'^ : JFq ((/iq , (/>i ) — > M are 
surjective. If x is a point of differentiability of (po, then the sei 7r^^(a;) H J^o(0O) 0i) contains one 
and only one point. There exists a Borel measurable set c M of full measure, whose points 
are points of differentiability of (po, and such that the map 

X I — y S{x) = n~^{x) n J"o(0o,^^i) 

is Borel measurable on S. 

Proof. The compactness of .?7)(0O)0i) follows from the fact, already mentioned, that the set 
of minimizing extremals 7 : [0, T] — > M is compact for the C^- topology. 

It is equivalent to say that the projection vr restricted to J-'q{(Po, (pi) is surjective, and to say 
that, for each point x G M, there exists a curve emanating from x in J^((pQ,(pi). In order to 
build such curves, recall that 

(Po{x) = max</.i(7(T)) - / L{-f{t,^it),t)dt 

T Jo 

where the maximum is taken on the set of cTirvcs which satisfy 7(0) = x. Any maximizing curve 
is then a curve of T{(po, (pi) which satisfies 7(0) = x. In order to prove that the map tt o ip"^ 
restricted to J-'o{(po,4'i) is surjective, it is sufficient to build, for each point x G M, a curve in 
J^{(po, (pi) which ends at x. Such a curve is obtained as a minimizer in the expression 

(Piix) = min0o(7(O)) + / L{j{t,jit),t)dt. 

T Jo 

Now let us consider a point x of differentiability of (po. Applying the general result on the 
differentiability of viscosity solutions to the Backward viscosity solution u, we get that there 
exists a unique maximizer to the problem 

(Poix) = max(/.i(7(r)) - / L{j{t,'y{t),t)dt 

T Jo 
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and that this maximizer is the extremal with initial condition {x , dpH (x , d(pQ{x) , 0)) . As a 
consequence, there exists one and only one point S{x) in J^Q{(j)Q, 0i) above x, and in addition we 
have the explicit expression 

S{x) = dpH{x,d(t>o{x),0). 

Since the set of points of differentiability of (po has total Lebesgue measure -because (j)o is 
Lipschitz- there exists a sequence of compact sets such that (po is differentiable at each point 
of Kfi and such that the Lebesgue measure of M — Kn is converging to zero. For each n, the 
set ■K~^{Kn) n To{4>o, 4>i) is compact, and the restriction to this set of the canonical projection 
vr is injective and continuous. It follows that the inverse function S is continuous on Kn- As a 
consequence, the map S is Borel measurable on S := U„^C„. □ 

Lemma 27. The initial transport measure ttiq is optimal if and only if it is an initial transport 
measure supported on .Fo(</>o, ^i)- 

Proof. This statement is a reformulation of the result in 14.31 stating that the optimal transport 
measures are the invariant measures supported on T(0o, </>i). □ 

Proposition 28. If fj,Q is absolutely continuous, then there exists a unique optimal initial 
measure rriQ. There exists a Borel section S : M — TAf of the canonical projection such 
that tuq = S^hq, this section is unique ^Q-almost everywhere. For each t € [0,T], the map 
TT o iPqO S : M — > M is then an optimal transport map between hq and ■ 

Proof. Let S : S — > TM be the Borel map constructed in Lemma [26l For convenience, we 
shall also denote by S the same map extended by zero outside of S, which is a Borel section 
S : M — > TM. Since the set S is of full Lebesgue measure, and since the measure is ab- 
solutely continuous, we have /Uo(S) = 1. Let us consider the measure mo = 5'j(/Uo|e)- This is a 
probability measure on TM, which is concentrated on ^^{(po, cpi), and which satisfies tt^tuq = fiQ. 
We claim that it is the only measure with these properties. Indeed, if mo is a measure with 
these properties, then TT^rho = /io, hence the measure mo is concentrated on 7r~^(S) n.Fo(</>o, (pi)- 
But then, since vr induces a Borel isomorphism from 7r~^(S) D .Fo((/)o, ^i) onto its image S, of 
inverse S, we must have mo = S^fiQ. As a consequence, the measure mo = S^hq is the only can- 
didate to be an optimal initial transport measure. Since we have already proved the existence 
of an optimal initial transport measure, it implies that mo is the only optimal initial transport 
measure. Of course, we could prove directly that mo is an initial transport measure, but as we 
have seen, it is not necessary. r-i 



5.1 Remark 

That there exists an optimal transport map if ^uq is absolutely continuous could be proved 
directly as a consequence of the following properties of the cost function. 

Lemma 29. The cost function c(^{x,y) is semi-concave on M x M. In addition, we have the 
following injectivity property for each x € M: If the differentials dxC^{x, y) and dxC^{x, y') exist 
and are equal, then y = y' . 

In view of these properties of the cost function, it is not hard to prove the following lemma 
using a Kantorovich optimal pair in the spirit of works of Brenier [12] and Carlier [16] . 

Lemma 30. There exists a compact subset K ^ M x M , such that the fiber = K D 7rQ""'^(x) 
contains one and only one point for Lebesgue almost every x, and which contains the support of 
all optimal plans. 
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The proof of the existence of an optimal map for an absolutely continuous measure can 
then be terminated using the following result, see p^, Proposition 2.1. 

Proposition 31. A transport plan r] is induced from a transport map if and only if it is con- 
centrated on a r]-measurable graph. 

5.2 Remark 

Assuming only that fiQ vanishes on countably {d — l)-rectifiable sets, we can conclude that the 
same property holds for all interpolating measures /ij, i < T, and that Proposition [28] hold. This 
is proved almost identically. The only refinement needed is that the set of singular points of the 
semi-convex function <j)Q is a countably (d — l)-rectifiable, see [11]. 

6 Aubry-Mather theory 

We explain the relations between the results obtained so far and Mather theory, and prove 
Theorem C. Up to now, we have worked with fixed measures iiq and fix- Let us study the 
optimal value Cq {fiQ, ht) as a function of the measures fiQ and ixt- 

Lemma 32. The function 

(/^o,/"t) I — > Cq{ho,ht) 
is convex and lower semi- continuous on the set of pairs of probability measures on M. 

Proof. It follows directly from the expression 

Cq{hq,iit)= max / ^id/^y - / (f^odno 
{'t>o,<f>i) Jm jm 

as a maximum of continuous linear functions. □ 

From now on, we consider that the Lagrangian L is defined for all times, L G C'^{TM x M, M), 
and satisfies 

L(x, t + 1) = L(a;, v, t) 

in addition to the standing hypotheses. Let us restate Theorem C with more details. Recall 
that a is the action of Mather measures, as defined in the introduction. 

Theorem C. There exists a Lipschitz vectorfield Xq on M such that all the Mather measures 
are supported on the graph of Xq. We have 

a = minCo(/i, /i), 

where the minimum is taken on the set of probability measures on M . The mapping mo i — > 
(7r)(|mo is a bijection between the set of Mather measures mo and the set of probability measures 
H on M satisfying CQ(/i,/i) = a. More precisely, if n is such a probability measure, then there 
exists one and only one initial transport measure mo for the transport problem between fiQ = fi 
and Hi = fi with cost cj, this measure is tuq = {Xq)^h, and it is a Mather measure. 

The proof, and related digressions, occupy the end of the section. 
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Lemma 33. The following minima 

/ieBi(M) 1 

exist and are all equal. In addition, any measure ji^ G Bi{M) which is minimizing Cq{ii,ijl) is 
also minimizing Co"(/x,/x) for all T G N. 

Proof. The existence of the minima follows from the compactness of the set of probability 
measures and from the semi-continuity of the function Cq. Let /i^ be a minimizing measure 
for tti and let m} be an optimal transport measure for the transportation problem Co(//^,//^). 
Let be the measure on TM x [0,T] obtained by concatenating T translated versions of m}. 
It means that rrv^ is the only measure on TM x [0,T] whose restriction to TM x + 1] is 
obtained by translation from m, for each integer i. It is easy to check that is indeed a 
transport measure between the jiQ = ij} and ht = A*^ on the times interval [0,T], and that 
A^im^) = rAj(m^). As a consequence, we have 

TaT ^ C^{l^\l^^) ^ A^im^) = TC^{pi\ii^) = Ta^ 

which implies the inequality ax ^ cei- 

Let us now prove that ax ^ cti- In order to do so, we consider an optimal measure fj?" for 
ax, and consider a transport interpolation G [0,T] between the measures //q = M"^ and 

Ht = ■ Let us then consider, for t G [0, 1], the measure 

l^t '■— J, 2^ l^t+ii 
i=0 

and note that Tjl^ = /Xq + Yljs^ /jj = /i^ + Yli=i I^J — ^Af- view of the convexity of the 
function Cq 

\ j=0 / i=0 

Since ft^ = pj, this implies that ai ^ ar, as desired. □ 



Lemma 34. We have ai ^ a. 

Proof. If mo is a Mather measure, then it is an initial measure for the transport prob- 
lem between = ('^)tj"^o and ni = {■K)^mo for the cost Cq. As a consequence, we have 
a = Aj(mo) ^ C(J(//o,Mo) ^ «!• □ 

Lemma 35. Let fi^ be a probability measure on M such that Cd(/i^,/n^) = ai. Then there 
exists a unique initial transport measure tuq for the transportation problem between jjiQ = and 

111 = /i^ for the cost Cg. This measure satisfies (^Q)jmo = m,Q. We have ai = AQ{mQ) ^ a, so 
that a = ai and thq is a Mather measure. There exists a constant K, which depends only on L, 
such that the measure niQ is supported on the graph of a K -Lipschitz vectorfield. 

Proof. Let us fix a probability measure /x^ on M such that Co(/i^,;U^) = ai. Let X : 
M X [0, 2] — > TM be a vectorfield associated to the transport problem Cq (/Li^, /x^) by Theorem 
A. Note that Xi is Lipschitz on M with a Lipschitz constant K which does not depend on /ii. 
We choose X once and for all and then fix it. 
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To each optimal transport measure for the transport problem Cg(//^,/i^), we associate 
the transport measure on TM x [0,2] obtained by concatenation of two translated versions 
of m^, as in the proof of Lemma [33j We have 

Alim"^) = 2Al{m^) = 2ai = 2a2 = C^{fi\fi^). 

The measure m? is thus an optimal transport measure for the transportation problem Cq (/x^, /x^). 
Let mt,t £ [0, 2] be the continuous family of probability measures on TM such that = mt^dt. 
Note that mt = {ipD^nis for all s and t in [0, 2], and that uiq is the initial transport measure for 
the transportation problem CQ(/i^,/x^) associated to m^. Since the measure m? was obtained 
by concatenation of two translated versions of the same measure m^, we must have mt+i = mt 
for almost all t g]0, 1[, and, by continuity, mo = rrii = m2. This implies that mo = {tpQ)^mQ. 
Finally, the characterization of optimal measures implies that mo = mi = We have 

proved that the measure {Xi)^n^ is the only optimal initial transport measure for the trans- 
portation problem Co(/i^, /x^). Q 

Proof of the theorem. Let mo be a Mather measure, and let fio = vr^mo. Note that we also 
have ^0 = (tt o ipQ)^mQ. As a consequence, mo is an initial transport measure for the transport 
between fiQ and fiQ for the cost cj, and we have 

Q = Ao(mo) ^ CQ(/io,/io) ^ Oil. 

Since ai = a, all these inequalities are equalities, so that mo is an optimal initial transport, and 
Co(/io, A'o) = Oil- It follows from Lemma [35] that mo is supported on the graph of a ii'-Lipschitz 
vectorfield. 

Up to now, we have proved that each Mather measure is supported on the graph of a K- 
Lipschitz vectorfield. There remains to prove that all Mather measures are supported on a 
single i^-Lipschitz graph. In order to prove this, let us denote by C TM the union of the 
supports of Mather measures. If {x,v) and {x',v') are two points of Ai, then there exists a 
Mather measure mo whose support contains (x, v) and a measure niQ whose support contains 
(x' , v'). But then the measure (mo + mo)/2 is clearly a Mather measure whose support contains 
{{x,v), {x' ,v')} and is itself included in the graph of a iC-Lipschitz vectorfield. Assuming that 
X and x' lie in the image 9{Bi) of a common chart, see appendix, so that (x, v) = d6{X, V) and 
{x',v') = de{X',V'), we obtain 

It follows that the restriction to Ai of the canonical projection TM — > M is a bi-Lipschitz home- 
omorphism, or equivalently that the set Ai is contained in the graph of a Lipschitz vectorfield. q 



A Notations and standing conventions 

• M is a compact manifold of dimension d, and vr : TM — > M is the canonical projection. 

• We denote by r : TM x [0, T] — > [0, T] or M x [0, T] — > [0, T] the projection on the 
second factor. 

• If is any separable, complete, locally compact metric space (for example M, M x [0,T], 
TM or TM x [0,r])) the sets Bi{N) C B+{N) C B{N) are respectively the set of Borel 
probability measures, non-negative Borel finite measures, and finite Borel signed measures. 
If Cc{N) is the set of continuous compactly supported functions on N, endowed with 
the topology of uniform convergence, then the space B{N) is identified with the set of 
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continuous linear forms on Cc{N) by the Riesz theorem. We will always endow the space 
B{N) with the weak-* topology that we will also call the weak topology. Note that the 
set Bi{N) is compact if N is. Prohorov's theorem states that a sequence of probability 
measures P„ G Bi{N) has a subsequence converging in Bi{N) for the weak-* topology if 
for all e > there exists a compact set such that PniN — K^) ^ e for all n G N. See 
e.g. [Mum [TO]. 

Given two manifolds N and N' , a Borel application F : — > N', and a measure /U G 
B{N), we define the push- forward Ff^fi of /i by F as the unique measure on A^' which 
satisfies 

for all Borel set G iV, or equivalently 

/ fdiF^fi) = [ foFdfi 
Jn' Jn 

for all continuous function / : N' — > R. 

A family /i^, t G [0, T] of measures in B{N) is called measurable if the map 1 1 — > ftd^t is 
Borel measurable for each / G Cc{N x [0, T]). We define the measure nt'^dt on N x [0, T] 

by 

/ fd{fit (^dt)= / ftdfit dt 

JNy.[0,T] Jo JN 

for each / G Cc{N x [0,r]). The well-known desintegration theorem states that, if is a 
measure on x [0, T] such that the projected measure on [0, T] is the Lebesgue measure 
dt, then there exists a measurable family of measures fit on such that ii = iit® dt. 

The set IC{fj,o,fiT) of transport plans is defined in section [L2l 

The set Z(/io,^T) of initial transport measures is defined in section [2Tl 

The set A4{fio,fiT) of transport measures is defined in section [2Tl 

The set C{^q,^t) of transport currents is defined in section [2^21 

We fix, once and for all, a finite atlas of M, formed by charts 9 : — > M, where Br 
is the open ball of radius r centered at zero in R*^. We assume in addition that the sets 
e{Bi),e G e cover M. 

We say that a vectorfield X : M — > TM is i^'-Lipschitz if, for each chart G 0, the 
mapping 11 o [dO)^^ o X o 6 : B^ — > is if-Lipschitz on Bi, where 11 is the projection 
55 X R'^ — > W^. 

We mention the following results which are used through the paper : There exists a 
constant C such that, if A is a subset of M, and Xa '■ A — > TM is a JC-Lipschitz 
vectorfield, then there exists a CiT-Lipschitz vectorfield X on M which extends Xj^. In 
addition, if A is a subset of M x [0, T] and X^ '■ A — > TM is a i^-Lipschitz vectorfield, 
then there exists a Cif-Lipschitz vectorfield X on M x [0,T] which extends Xa- If A 
is a compact subset of M x [0,T] and Xa '■ A n Mx]0,r[ — > TM is a locally Lipschitz 
vectorfield (which is i^(e)-Lipschitz on j4 n M x [e,r — e]), then there exists a locally 
Lipschitz (which is C-ftr(e)-Lipschitz on M x [e,T — e]) vectorfield X on Mx]0,T[ which 
extends Xa, 
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